


PROCEEDINGS 


OF THE 


NATIONAL ACADEMY OF SCIENCES 


THE ASSIMILATION OF AMMONIUM NITROGEN BY 
NITZSCHIA CLOSTERIUM AND OTHER MARINE 
PHYTOPLANKTON 


By CLAUDE E. ZOBELL 
Scripps INSTITUTION OF OCEANOGRAPHY, UNIVERSITY OF CALIFORNIA, LA JOLLA, CALIF. 


Communicated July 1, 1935 


The origin of ammonia in the sea is a fairly well understood process 
but little is known concerning the formation of nitrate. It is the con- 
sensus of opinion of most oceanographers that nitrate nitrogen is essential 
for the production of marine vegetation, although experiments during the 
last two decades of many investigators (cf. reviews by Allison, 1931, and 
Pardo, 1935) have established that a large number of terrestrial plants 
assimilate ammonium nitrogen. Brandt (1929), Thompson and Robinson 
(1932), and others point out that oceanographic data including chemical 
analyses fail to indicate the utilization of ammonium nitrogen by phyto- 
plankton. However, in discussing a more satisfactory method for the 
quantitative estimation of ammonia in sea water, Wattenberg (1929) 
suggests that it is advisable to devote more attention to this form of nitro- 
gen as a possible nutrient for marine flora. Cooper’s (1933) extensive 
investigations in the English channel indicate that ammonia may be 
directly utilized by phytoplankton. The following results furnish proof 
that certain bacteria-free cultures of phytoplankton do assimilate ammo- 
nium nitrogen. 

Experimental.—The organisms were cultured in aged sea water to 
which was added 1.0 cc. per liter of a mineral solution essentially the same 
as that recommended by Allen (1913) plus a little silicate: 


NaezH PO, 12H2O 5.0 gm. 
FeCl;-6H2O .0 gm. 
NazSiO3:9H:O .0 gm. 
HCI (con.) 2.0 ce. 
Distilled water 100.0 ce. 


Like Harvey (1933), it was found that the silicate improved the growth 
of the diatoms and sometimes it became a limiting factor when sea water 











518 BOTANY: C. E. ZOBELL Proc. N. A. S. 


was not enriched with it. The medium was heated to 70°C., cooled 
and the clear supernatant fluid aseptically decanted into sterile Pyrex 
Erlenmeyer flasks. Sterile solutions of various ammonium, nitrite and 
nitrate salts were added in different concentrations. 

Most of the results reported below were obtained with a pure bacteria- 
free culture of Nitzschia closterium. Locally isolated cultures of Nitzschia 
bilobata, Navicula sp., Chlorella sp. and certain mixed cultures were also 
used. The test media were inoculated with uniform suspensions of the 
cultures to give approximately 500 cells per cc. after which they were 
exposed to indirect sunlight in north windows. The comparative rates 
of multiplication were followed by noting the first day on which growth 
became macroscopically visible and estimating the relative turbidity 
each day thereafter. Attempts were made to count the number of cells 
per unit of solution but their adherence to the glass and their tendency 
to form tenacious clumps under certain conditions interfered with direct 
microscopic enumeration. Frequent quantitative determinations of the 


TABLE 1 
COMPARATIVE ASSIMILATION OF AMMONIUM, NITRITE AND NITRATE BY N. closterium as 
INDICATED BY GROWTH AND THE DISAPPEARANCE OF THE NITROGEN COMPOUNDS 
STARTING WITH 0.20 MILLIMOL OF EACH 


TIME IN AMMONIUM NITRITE NITRATE 


DAYS GROWTH NH4 GROWTH NO2 GROWTH NO3 

0 ~ 0.20 _ 0.20 _ 0.20 
4 T 0.19 ? 0.20 —- 0.18 
7 + 0.19 T 0.20 i 0.16 
10 0.17 : & 0.18 + 0.16 
14 ++ 0.16 ++ 0.17 aoe 0.15 
18 +44 0.12 coe 0.15 +++ 0.14 
21 coe ae 0.11 ++ 0.15 +++ 0.10 
28 +++ 0.06 +44 0.09 5 sos a es 0.03 
35 +444 0.01 +++ 0.04 +4++4+4 0.001 
42 ++++ 0 = as Sie Fone 0 = a (os a 0 


— no growth, T trace, + to +++-+ increasing amounts of growth. 


nitrogen compounds furnished another accurate index. to the activity of 
the plankton in the different media. Nitrite was determined with the 
sulphanilic acid dimethyl-a-naphthylamine reagent, ammonia with Watten- 
berg’s modified Nessler’s solution and nitrate with diphenylbenzidine. 
Table 1 illustrates the kind of record which was kept and shows that 
bacteria-free cultures of N. closterium assimilate ammonium and in so 
doing multiply more rapidly at first than in corresponding media con- 
taining equi-molar concentrations of either nitrite or nitrate. Simi- 
larly Nitzschia bilobata, Navicula sp., Chlorella sp. and certain mixed 
cultures of phytoplankton flourished on ammonium nitrogen, quanti- 
tatively exhausting as much as 0.2 millimol of this nitrogen compound 








of 
he 


at 
SO 


i- 
ed 
ti- 
ad 








VoL. 21, 1935 BOTANY: C. E. ZOBELL 519 


from the sea-water solutions. There were no macroscopic signs of multi- 
plication in the controls to which no nitrogen was added although micro- 
scopic examinations revealed a two- to four-fold increase in the number 
of cells, probably attributable to a small quantity of residual available 
nitrogen in the sea water. Later experiments with synthetic sea-water 
controls demonstrated that the diatoms multiply with ammonium as the 
only source of nitrogen. 

Provided small concentrations (0.2 millimol or less) of ammonium, 
nitrite and nitrate are compared, in virtually every instance multiplication 
and utilization first becomes manifest with ammonium. The initial 
superiority of ammonium vanishes in a week or two after which time 
there is little or no difference in the quantity of vegetation. The addition 
of a little ammonium to either nitrite or nitrate media accelerates multi- 
plication. Combinations of ammonium and nitrate compounds are found 
to be better than either ion used separately. Ammonium nitrate in con- 
centrations up to 0.1 millimol gave good results. The chloride, hy- 


TABLE 2 
OPTIMUM AND INHIBITORY CONCENTRATIONS OF AMMONIUM, NITRITE AND NITRATE FOR 
THE MULTIPLICATION OF N. closterium AND THE CONCENTRATION OF THESE IONS 
REPORTED FOUND IN THE SEA 


FORM OF MILLIMOLS MILLIMOLS IN SEA WATER 

NITROGEN OPTIMUM INHIBITORY AVERAGE RANGE 
Ammonium 0.01-0.05 4 0.001-0.005 0-0.02 
Nitrite 0.05-5.00 25 0.000,1 0-0.001 
Nitrate 0.10-40.0 300 0.002-0.01 0-0.05 


droxide, sulfate, phosphate or carbonate of ammonium supplied the 
nitrogen requirements of the diatoms. 

Undoubtedly some of the reported failures of phytoplankton and other 
plants to grow in ammonium media are due to the use of excessive quan- 
tities of ammonium. The effect of the concentration of ammonium, 
nitrate and nitrite nitrogen was studied by inoculating sea-water media 
containing the nitrogen compounds in different combinations with diatoms. 
The optimum concentration for multiplication was accepted as that in 
which in the majority of the cases vegetation first became perceptible to 
the naked eye and not necessarily that in which growth was best after 
prolonged incubation. The results for N. closterium are presented in 
table 2 which also expresses the quantity of these ions reported to be 
found in the sea. It will be observed that the optimum concentration of 
nitrate is over a hundred times greater than that for ammonium and also 
the optimum concentration of each nitrogen compound is greatly in 
excess of its average concentration found in the sea. In media which 
contained 0.002 millimol of either ammonium or nitrate nitrogen, ap- 
proximately the average concentration which occurs in the sea, micro- 








520 BOTANY: C. E. ZOBELL Proc. N. A. S. 


scopic examinations revealed appreciable multiplication with ammonium 
being superior at first, but later there was no detectable difference in the 
number of cells in the two substrata. 

Tiedjens and Robbins (1931) confirm the findings of several other 
investigators that many crop plants preferentially assimilate ammonium 
at pH 7.0 or above whereas in acid media nitrate is more available, and 
at pH 4.0 ammonium is not assimilated. Efforts were made to ascertain 
the effect of the hydrogen-ion concentration upon the assimilation of 
ammonium by the marine phytoplankton. Dilute sodium hydroxide or 
hydrochloric acid was added to the sea-water media to give reactions 

















10 
.06 4 
+8 
05 
x 
2 WwW 
04 6 4 
4 
F 
903 = 
z rs 
3 e 
a 
=02 
= oO 
+2 
014 
(e) 
5 10 15 20 25 30 
DAYS 
FIGURE 1 


Depletion of nitrate (dotted line) and ammonium nitrogen (dashed line) accom- 
panying the multiplication of N. closterium in sea-water medium which initially 
contained a mixture of 0.05 millimol of each. The appearance and disappearance of 
nitrate (dot-dash line) and the cumulation of vegetation (solid line) are also shown. 


ranging by increments of 0.4 from pH 6.0 to 9.0. The most acid media 
11 which JN. closterium grew was pH 6.8 but the growth was so scant and 
abnormal that it was impossible to differentiate between the results with 
nitrate and ammonium. Since both forms of nitrogen supported multi- 
plication in more alkaline media up to and including pH 8.8, these values 
are accepted as the limits. The optimum was found to be pH 7.6 to 
8.0 although the sea-water habitat of these organisms is pH 8.0 to 8.4. 
There were no consistent differences in the assimilation of ammonium 
and nitrate at the different reactions. 

Evidence for the preferential assimilation of ammonium in the presence 








INDEX 


GROWTH 


ice 








VoL. 21, 1935 BOTANY: C. E. ZOBELL 521 


of nitrate is presented graphically in figure 1. These data were obtained 
by inoculating media containing 0.05 millimol each of ammonium and 
nitrate with pure cultures of N. closterium. At various intervals the 
amount of each ion and of vegetation was determined. Under these 
conditions the ammonium was depleted from the solution several days 
before the nitrate. Special attention is called to the appearance of a trace 
of nitrites. 

Nitrites have been detected in small quantities in nitrate media seeded 
with Nitzschia closterium, N. bilobata, Navicula sp., Chlorella sp. and several 
unidentified mixed cultures of phytoplankton under conditions which are 
unfavorable to the activity of denitrifying bacteria. Depending primarily 
upon the size of the inoculum, a trace of nitrite begins to appear within 
three to seven days. The addition of a little ammonium to nitrate media 
usually increases the amount of nitrite which accumulates. Inasmuch 
as nitrite does not occur in ammonium medium lacking nitrate and does 
occur in nitrate media lacking ammonium, it is assumed that the diatoms 
reduce the nitrate to nitrite. During the initial positive acceleration 
growth phase the concentration of nitrite reaches its maximum and de- 
creases with fluctuations during the logarithmic phase of growth. Fi- 
nally the nitrite is exhausted at about the same time as the nitrate but due 
to the difficulties of determining nitrate in the presence of nitrite, it has 
not yet been possible to ascertain which disappears first or if they are 
simultaneously depleted from solution by the diatoms. It may be signifi- 
cant that in nitrate media teeming with multiplying diatoms, the con- 
centration of nitrite is usually greatest early in the morning and diminishes 
with exposure to light. A few million diatoms per cc. were found to reduce 
a little nitrate to nitrite in total darkness and it is reiterated that rigorous 
cultural as well as direct microscopic tests proved the absence of bacteria. 

Conclusions.—It is believed that the diatoms extracellularly reduce 
nitrate to nitrite in the process of its assimilation and eventual conversion 
into organic nitrogen. Beckwith (1933) demonstrated that certain pure 
cultures of Chlorella reduced nitrate to nitrite and that they assimilated 
the nitrite. Eckerson (1924) and others have presented experimental 
evidence to show that during protein synthesis by certain higher green 
plants nitrate is reduced to nitrite and then to ammonia. In view of the 
fact that protoplasmic nitrogen is in a reduced state, theoretically nitrite 
or ammonium nitrogen should be more readily available than nitrate 
nitrogen. It does not seem logical that all plant nitrogen must go through 
the roundabout process of being completely oxidized to nitrate only to 
be reduced again for conversion into amino nitrogen of protoplasm. It 
is more credible that particularly in the sea where the economy of Nature 
is at its best there is a short-cut from this part of the nitrogen cycle with 
nitrification serving primarily as a regulatory or storage process. Cooper 








522 BOTANY: C. E. ZOBELL Proc. N. A. S. 


(1933) has shown that Biddulphia mobiliensis assimilates ammonia from 
sea water. According to Schreiber (1927) more Carteria cells were pro- 
duced from ammonium than from an equivalent of nitrate. Harvey 
(1933) found that Nitzschia closterium grew just about as well in ammonium 
as in nitrate. Pure cultures of Chlamydomonas utilized ammonium more 
efficiently than nitrate in experiments described by Braarud and Foéyn 
(1931) and Bond (1933) found that ammonium nitrogen satisfied the 
requirements of Portotheca zopfii, Polytoma uvella, Chlamydomonas pul- 
vinata and Dunaliella salina. 

Acknowledgment is here made to Dr. H. A. Barker of Hopkins Marine 
Station and to Dr. E. E. Cupp of Scripps Institution for furnishing cultures 
of diatoms and for giving helpful advice and assistance. 


Allison, F. E., ‘‘Forms of Nitrogen Assimilated by Plants,” Quart. Rev. Biol., 6, 313- 
321 (1929). 

Beckwith, T. D., ‘‘Metabolic Studies upon Certain Chlorellas and Allied Forms,” 
Univ. of Calif. at L. A. Publ. in Biol. Sci., 1, 1-34 (1933). 

Bond, R. M., “A Contribution to the Study of the Natural Food-Cycle in Aquatic 
Environments,” Bull. Bingham Oceanogr. Coll., 4, 1-89 (1933). 

Braarud, T., and Foéyn, B., “‘Beitrage zur Kenntnis des Stoffwechsels in Meere,”’ 
Avhandl. Norske Videnskak-A kadimi i Oslo, 14, 1-24 (1931). 

Brandt, K., “Stickstoffverbindungen im Meere,” Wiss. Meeresuntersuchungen, 20, 
203-292 (1927). 

Cooper, L. H. N., ‘‘Chemical Constituents of Biological Importance in the English 
Channel November, 1930, to January, 1932,’’ Jour. Mar. Biol. Assoc., 18, 677-728 
(1933). 

Eckerson, S. H., ‘Protein Synthesis by Plants. I. Nitrate Reduction,’’ Bot. Gaz., 77, 
377-390 (1924). 

Harvey, H. W., ‘‘On the Rate of Diatom Growth,” Jour. Mar. Biol. Assoc., 19, 253- 
276 (1933). 

Pardo, J. H., “Ammonium in the Nutrition of Higher Green Plants,” Quart. Rev. 
Biol., 10, 1-31 (1935). 

Schreiber, E., ‘Die Reinkultur von marinem Phytoplankton und deren Bedeutung 
fiir die Erforschung der Produktionfahigkeit des Meerwassers,” Wiss. Meeresunter- 
suchungen, 16, 1-35 (1927). 

Thompson, T. G., and Robinson, R. J., ‘‘Chemistry of the Sea,” Bull. Nat. Res. 
Coun., Oceanogr., 5, 95-203 (1932). ; 

Tiedjens, V. A., and Robbins, W. R., “The Use of Ammonia and Nitrate Nitrogen 
by Certain Crop Plants,” N. J. Agric. Exper. Sta. Bull., 526, 1-46 (1931). 

Wattenberg, H., ‘“‘A Simple Method for the Direct Estimation of Ammonia in Sea 
Water by the Use of Nessler’s Reagent,” Rapp. Proc. Verb., 53, 108-114 (1929). 








‘0- 


ey 
im 
re 


yn 
he 
ul- 


ne 
res 


Rev. 


ung 
ter- 


gen 


Sea 








VoL. 21, 1935 MATHEMATICS: G. A. MILLER 523 


GROUPS IN WHICH THE SQUARES GENERATE A SUBGROUP 
OF INDEX LESS THAN SEVEN 


By G. A. MILLER 


DEPARTMENT OF MATHEMATICS, UNIVERSITY OF ILLINOIS 


Communicated July 27, 1935 


The squares of the operators of a given group G generate an invariant 
subgroup H and the quotient group G/H is an abelian group of order 
2“ and of type (1, 1, 1, ...). If a Sylow subgroup of order 2” contained 
in G is cyclic but not the identity then G contains one and only one sub- 
group of index 2 and hence this subgroup is generated by the squares of 
its operators. The fact that such a G contains at least one subgroup of 
index 2 is a direct consequence of the obvious existence of negative sub- 
stitutions therein when it is represented as a regular substitution group. 
It could not contain more than one such subgroup since the cross-cut of 
all of its subgroups of index 2 is invariant and gives rise to an abelian 
quotient group of order 2* and of type (1, 1, 1, ...). This invariant sub- 
group is evidently generated by the squares of the operators of G since 
these squares appear in every subgroup of index 2. 

The fact that there are groups which separately involve only one sub- 
group of index 2 but do not contain a cyclic Sylow subgroup of order 2” 
results directly from the symmetric groups of degree m > 3, since each of 
these groups is known to contain only one subgroup of index 2 but its 
Sylow subgroup of order 2” is non-cyclic. The direct product of a simple 
group whose order exceeds 2 and the group of order 2 contains one and 
only one subgroup of index 2 and hence this subgroup is generated by the 
squares of its operators. Moreover, if such a simple group is extended by 
an operator so as to obtain a group of twice its order and if this extending 
operator transforms it according to an inner isomorphism then this ex- 
tended group is such a direct product. That is, if a group contains a 
simple group whose order exceeds 2 as a subgroup of index 2 generated by 
the squares of its operators but is not the direct product of this simple group 
and a group of order 2 then it is the extension of this simple group and an 
operator which transforms 1t according to an outer isomorphism whose order 
1s a power of 2. 

A necessary and sufficient condition that the squares of the operators 
of a group of order 2” generate a subgroup of index 2 is that this group 
is cyclic. In a non-cyclic group of order 2” the subgroup generated by 
the squares of its operators therefore has an index which is at least 4. We 
proceed to determine some of the properties of those groups in which this 
index is exactly 4. A necessary and sufficient condition that this index 
is exactly 4 is that the group G contains exactly three subgroups of index 2. 








524 MATHEMATICS: G. A. MILLER Proc. N. A. S. 


Another necessary and sufficient condition is that G contains a set of two 
independent generators and hence its ¢-subgroup is of index 4. We shall 
now prove that no one of the three subgroups of index 2 contained in G 
has more than three independent generators. 

To prove this fact we shall first prove that G cannot contain an abelian 
subgroup of type (1, 1, 1, ...) which contains as many as four independent 
generators. If such a subgroup appeared in G an additional operator of 
G would transform all the commutators arising from it into themselves 
and hence the order of the commutator subgroup could not exceed the 
square root of the order of this abelian subgroup. The squares of the 
additional operators of G would also appear in the central of G. Hence 
the commutators of G and the squares of the operators of G would generate 
a characteristic subgroup of G whose order could not exceed the double of 
the order of this commutator subgroup and hence it would be at least of 
index 8 under G. Consequently G would involve at least seven subgroups 
of index 2. As this is contrary to the hypothesis it has been proved that 
G cannot contain an abelian subgroup of index 2 and of type (1, 1, 1, ...) 
which has as many as four independent generators. 

If G involved another subgroup of index 2 containing at least as many 
as four independent generators then the squares of the operators of this 
subgroup would generate a characteristic subgroup of it which would 
give rise to an abelian quotient group of type (1, 1, 1, ...) having at least 
as many as four independent generators. The reasons why this is im- 
possible are obviously similar to those noted in the preceding para- 
graph and hence there results the following theorem: Jf a group of order 
2” has the property that the squares of its operators generate a subgroup whose 
index does not exceed 4 then it cannot involve a subgroup of index 2 which has 
as many as four independent generators. It therefore results that if in a 
group of order 2” the squares of the operators generate a subgroup whose 
index does not exceed 4 then it involves no subgroup of index 2 which has 
more than three independent generators. 

It is well known that when G involves a cyclic subgroup of index 2 there 
are five and only five groups of order 2”, m > 3, in which the squares of 
the operators generate a subgroup which is exactly of index 4. Two of 
these groups involve separately two cyclic subgroups of index 2 while each 
of the remaining three involves only one such subgroup. In the latter 
cases each of the two additional subgroups of index 2 has two independent 
generators. Every quotient group of the groups under consideration has 
at most two independent generators. In particular, the possible non- 
cyclic quotient groups of order 8 are the quaternion group, the octic group 
and the abelian group of type (2,1). 

To prove that the same G cannot have more than one of these groups of 
order 8 as a quotient group we may note that first if it had at least two 








vO 
ill 


un 
nt 
of 


he 
he 
ce 


ite 
of 


ps 
at 


ny 
his 
ild 
ast 


ra- 
ler 
ose 
1S 
a 
yse 
1as 


ere 
of 

of 
ich 
ter 
ent 
has 
on- 


yup 


; of 
wo 








VoL. 21, 1935 MATHEMATICS: G. A. MILLER 525 


such quotient groups it would have at least three, which would not neces- 
sarily be distinct, since each such quotient group would involve a sub- 
group of order 2 corresponding to the subgroup of index 4 in G generated 
by the squares of its operators. Hence two invariant subgroups of index 8 
which would give rise to these quotient groups would have an invariant 
cross-cut of index 16 under G which would give rise to the four group as 
a quotient group under the subgroup of the squares in G. The squares 
of some of the remaining operators of G would appear in every pair of co- 
sets corresponding to operators of this four group. As this is obviously 
impossible it has been proved that a group of order 2” in which the squares 
of the operators generate a subgroup of index 4 is completely characterized by 
ats non-cyclic quotient group of order 8. 

It is easy to prove that there are infinite systems of groups belonging 
to each of the three groups of order 8 having two independent generators. 
To construct such a system belonging to the quaternion group we may 
start with an arbitrary abelian group having either two or three inde- 
pendent generators in the restricted sense of this term. In the former 
case we may assume that an additional operator which transforms the 
independent generator whose order is at least as large as the order of the 
other into itself multiplied by this other which it transforms into its in- 
verse. Hence it is commutative with the product of the square of the 
former independent generator into the latter and may be assumed to have 
this product for its square. To the identity of the quaternion quotient 
group there corresponds the group generated by the fourth power of the 
former, the square of the latter of the given independent generators, and 
the product of the latter into the square of the former. 

If the arbitrary abelian group has three independent generators 5), Se, 
53 We may assume that the order of s,; does not exceed that of s2 and that 
the additional operator has s3 for its square, transforms s, into its inverse 
and transforms 5 into itself multiplied by s;. The subgroup corresponding 
to the identity of the quaternion quotient group is then generated by the 
squares of s; and 53, by the fourth power of sz and by the products of s; into 
Sg? and s3. To prove that there exist an infinite number of such groups 
which have either the octic group or the abelian group of type (2,1) asa 
quotient group we may note that in each of these cases it may be assumed 
that G contains an arbitrary abelian subgroup having either two or three 
independent generators. The arguments in each of these cases are similar 
to those employed above when the quotient group is the quaternion group. 








526 MATHEMATICS: A. D. MICHAL Proc. N. A. S. 


“RIEMANNIAN” DIFFERENTIAL GEOMETRY IN ABSTRACT 
SPACES} 


By A. D. MIcHAL 
DEPARTMENT OF MATHEMATICS, CALIFORNIA INSTITUTE OF TECHNOLOGY 


Communicated July 26, 1935 


1. Introduction.—In this note brief indications are given of a set of 
postulates for ‘‘Riemannian” differential geometry in abstract spaces. 
The detailed treatment of this geometry has been included in a compre- 
hensive memoir that I intend to publish elsewhere. An introduction to 
more general abstract differential geometries with one or several linear 
connections has been given by me in another comprehensive memoir.” 

It has become clear in recent years that a differential geometry is con- 
cerned with several spaces together with their interconnections by means 
of interspace functional transformations. Moreover, the differentiability 
of at least one of these functional transformations enters into the theory 
in an essential manner. It is evident from these remarks that some of the 
spaces must be sufficiently restricted so as to be capable of supporting a 
theory of differentials.* 

Let R be the class of real numbers, which we take as a known system.‘ 
The universe of discourse of our abstract Riemannian geometry consists 
of two classes E; and EF, of undefined elements, two relations of equality § 
two operations of addition, two operations of multiplication by elements 
of R, two norms, one interspace inner product [. , . ], two metric functions, 
g(.,.), g(.,.), two “parallelism” functions g(.,.,.), f(.,.,.) and two 
“curvature” functions g(.,.,.,.), f(.,.,.,.). The theory of this universe 
of discourse subject to the sixty postulates (classified into eight groups) 
with [5x, g(x, 5x)]’* as element of arc length constitutes our abstract 
Riemannian differential geometry. In the first comprehensive memoir 
referred to above we have stated the sixty postulates in such a manner 
as to make each postulate intelligible independently of the remaining ones. 
In order to avoid lengthy statements in the present note we have not done 
this for the thirty postulates that are explicitly given in section 2. 

The classes E, and E, have been shown to have all the properties of a 
complete normed vector space. The commutativity and associativity 
of each of the two additions, the usual three properties of each of the two 
equalities, the existence of the two zero elements and several continuity, 
differentiability and substitution properties are not postulated but proved 
as theorems. 

Besides the classical finite dimensional examples and the author’s 
infinitely dimensional instances,’ several new instances and specializations 
of the general abstract theory have been found. 








of 


to 
ar 


n- 
ns 
ty 


he 


sts 
y 5 
its 
1S, 
vO 
se 
S) 
ct 
oir 


es. 
ne 


fa 
ity 
wo 
ty, 
ed 


r’s 
ns 








VoL. 21, 1935 MATHEMATICS: A. D. MICHAL 527 


2. The Postulates.— 


Group I: Postulates for FE; 
Group II: Postulates for EF, 
Group III: Postulates for interspace inner product [£,7] 


(1) If gAi, neH, then [£, nJeR; (2) [& + &, n]=[é&, 2] + [&, 9]; (3) 
[Em + m2] = [& m] + [& me]; (4) | [& n) | S A Ell ||]; () [é, 2] = 0 for 
all implies » + m = m for allm; (6) [é, 7] = 0 for all 7 implies § + & = 
& for all &. 


GroupIV: Postulates for metric g(x, £) and its inverse g(x, 7) 

(1) If xeEi((x0),), feHi, then g(x, E)eEe; (2) || g(x, &) || S Mi || ||; (3) if 
Xx, = 2, then g(x, i) = (x2, £); (4) if xeHi((xo),), nee, then g(x, neEi; 
(5) if x; = x2, then 2(x1, n) = 2(x2, n); (6) g(x, g(x, n)) cea (7) g(x, g(x, £)) 
= § (8) [&, g(x, &)] = [&, g(x, &)); (9) [é g(x, &] = 0; (10) if [é, 
g(x, —)] = 0, then & + & = & for all &. 


Group V: First group of postulates for parallelism 
(1) If weEi((%0)a), & yeHi, then g(x, £ y)eBe; (2) g(x, & m1 + 92) = 
g(x, & v1) + g(x, & ye); (3) given an e > 0 there exists a 6 > 0 such that 
\| |! <6 implies || g(x, & y)| <; (4) given an e€ > 0 there exists a 5 > 0 
such that ||y || < 6 implies || (g(x + y, &) + (—lg(x, &)) + (—Degt, & 9) || 
Se\\y\|; (5) if 1 = x2, then g(x, & y) = g(x, & y); (6) if & = &, then 
g(x, £1, ¥) = g(x, & ¥); (7) if 1 = yo, then g(x, £1) = g(x, & 92). 


Group VI: Second group of postulates for parallelism 
(1) If xeFi((xo)a), &, eH, then f(x, £1, f)eHe; (2) Ly, f(x, &, &)] = 
[&:, g(x, &, y)]; (3) given an e > 0 there exists a 6 > 0 such that || & || <6 
implies || f(x, &, &) || <e. 


Group VII: First group of postulates for curvature 
(1) If xeEi((x0),), £ di, YoeHi, then g(x, & v1, ye)eHe; ...; (4) given an 
e > 0 there exists a 5 > 0 such that || yo|| <6 implies || (g(x + ye, & yi) + 
(—1)g(x, & »)) + (—Degls, & yy) || S € lly]; ..- 


Group VIII: Second group of postulates for curvature 


(1) If xeFi((xo),), £1, 2, ye, then f(x, &, , yee; ...; (4) given an 
e> 0 there exists a 6 > 0 such that || y|| <6 implies || (f(x + y, &, &) + 


(—1)f(x, &, &&)) + (—1) f(x, & & y) Il Se llyll; .... 


3. Parallelism and Curvature.—Each function considered in this section 
has been shown to be continuous jointly in all its arguments and, except 
for the dependence on the variable x, linear in each argument. The 
differentials’ g(x, &; dx), g(x, £; dix; dex), f(x, £1, £2; dx) exist equal to g(x, 
§, 6x), g(x, &, dix, box), f(x, £1, &, dx), respectively. 








528 MATHEMATICS: A. D. MICHAL Proc. N. A. S. 


The following two functions I'\(x, &, &) and I's(x, 7, &) play a réle 
analogous to that of the Christoffel symbols of the second kind in n- 
dimensional classical Riemannian geometry. They are defined by 


T(x, £1, 2) = /2g(x, g(x, £15 &) + g(x, &; &) — f(x, &, &)) 
T2(x, 1, &) = "/2 {e(x, a(x, 0); £) + f(x, g(x, 0), &) — g(x, B(x, 0))} 
and have been shown to satisfy the identity 
(Tix, £1, &&), 1] = [&, Ta, 0, &)]. 
The “‘covariant’’ differential g(x, &/&) defined by 
g(x, bi/t) = g(x, £1; &) — g(x, Tilx, &1, &)) — Te(x, g(x, &), &) 


vanishes. Moreover, the equation g(x, &/f) = 0 considered as a func- 
tional equation with the functions I’, and T, as unknowns has the unique 
solution (1). With the aid of the functions T, and I, it is possible to 
develop a theory of parallel displacements and geodesics. 

Let 


vi(x, £1, &) = g(x, Tilx, &, &)), ve(x, &, &) = T(x, g(x, &), &) 


then the function R;(x, &, &, &) defined as follows plays a réle similar to 
the Riemann Christoffel curvature tensor in classical Riemannian geometry 


Ri(x, £1, & &) = */o{ g(x, £2; &1; &) + f(x, &, &5 &) — f(x, &, f23 &) 
—g(x, t; 1; &)} + velx, g(x, s(x, &, &)), &) 
— yo(x, 2(x, vi(x, &, &)), &3). 


The following identities have been shown to hold for the curvature form R;: 


Ri(x, &, bs, &3) =o Ri(x, &, §3, &), [é, Ri(x, &1, bo, £3) ] oa [&, Ri(x, g, £5, 
f)], [& Ri(x, &, &, &)] = [& Rilx, &, & &)], Rilx, &, &, &) + Rilx, 
&, £1, &) + Ri(x, &, &, &) = 0. 


The addition of a few postulates has enabled us to study abstract 
Riemannian geometry with torsion while an evident modification of the 
postulates has led to a geometry with abstract Hermitian differential 
metric. As an interesting by-product we mention the fact that a slight 
modification of the first six groups of postulates alone suffices for an ab- 
stract dynamics with '/, [dx/dt, g(x, dx/dt)] as kinetic energy. 

1 Presented at the 1933 Pasadena meeting of the American Math. Soc. The present 
drastically reduced set of postulates was presented at the 1935 Stanford meeting of 
the Amer. Math. Soc. Cf. Bull. Am. Math. Soc., 39, 879 (Nov., 1933), and 41, 195 
(March, 1935). 

2 Presented at the 1933 Pasadena meeting of the Am. Math. Soc. Cf. Bull. Am. Math. 
Soc., 39, 880 (Nov., 1933). 

3 Fréchet, M., Ann. Sc. Ec. Norm. Sup., t. 62, 293-323 (1925). 








nc- 
jue 
to 


to 
try 


R;: 


(x, 


act 
the 
tial 
ght 
ab- 


sent 
g of 
195 


fath. 








VoL. 21, 1935 MATHEMATICS: E. FISHER 529 


4 A more general theory will result if in the place of R we employ an abstract normed 
ring. 

’ We use the symbols =, +, ||... ||, ... in more than one sense. No confusion 
need arise as the context makes clear the meaning of each such symbol. It is worth 
while to mention here that the relation of equality = for #; as well as for Zz is not an 
independent primitive idea; for, an equivalent set of postulates can be given in which 
the equality = for E; as well as for E2 is defined by means of the properties of the corre- 
sponding norm function ||... ||. 

6 Banach, S., Théorte des Opérations Linéaires, chapter IV, Warsaw, 1932; see also 
Fréchet, M., Espaces Abstraits, Paris, 1928. 

7 Michal, A. D., ‘‘Affinely Connected Function Space Manifolds,’”’ Am. Jour. Math., 
50, 473-517, especially 509-517 (1928); ‘‘Differential Geometries of Function Space,” 
these PROCEEDINGS, 16, 88-94 (Jan., 1930). See also Peterson, T. S., ‘The Analogue 
of Weyl’s Conformal Curvature Tensor in a Michal Functional Geometry,” Annali di 
Mat., 13, 55-62 (1934). 

8 The first partial differential with respect to x of a function F(x, 1, ..., yn) is denoted 
by F(x, ¥1, ..., n; 6x). All differentials are taken in the Fréchet sense. 


ASYMPTOTIC REPRESENTATIONS OF CONFLUENT HYPER- 
GEOMETRIC FUNCTIONS 


By E. FISHER 
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 


Communicated July 18, 1935 


In a number of problems in quantum mechanics there is need for simple 
asymptotic approximations to the confluent hypergeometric function 
My, (2). In the following note a combination of the recurrence and 
differential properties of these functions has led to such expressions for 
the cases |z|>>|k|,1; |m|>|k|,1and|z|>>|m|, 1. The notation and 
fundamental formulas used are from Whittaker and Watson, Modern 
Analysis. 

CaseI. |z|>|kl|,1. 

If we let F,, m(z) = 2 *e/?W,, (z), the function F satisfies the two equa- 

tions, 








m? — (k — 1/,)? 2k 
2 Fe—im + (1 7 2) Fim = Fetim (1) 
d k — 1/,)2 — m? 
rae wa! iy Fy ~ 3, = (2) 
dz 3 


From equation (1), neglecting 0(z—"), we obtain 


m? 1 1 
O = Fa Fem t Fetim— Fete = $+ +50 or nae Fy m 








530 MATHEMATICS: E. FISHER Proc. N. A. S. 


2 





where 5F;, m = Fre+i.m — Fr,m and D? = 1+ A general solution 


32 
chal adhe 1 a tie ts 
of this difference equation is c(m, z) te ee Bie d(m, 2)\5 — 5D). 


1 1 k 
However, since F;, », —> 1, and since, when R(k) <0, & ~ > D) —> 0, 


2—> @ 2—> 2 


the solution representing F must be 
ae Bet 
Fy, m(2) = c(m,2)}(5+5D) +027 |. (3) 


Because of the branch point in D and because of our use of s = ~ asa 
boundary point, we shall require that | m | <|2/2 | , although this require- 
ment may be more stringent than is necessary. 

Equation (3) may be written 


(Fy 1, m/ Fr, m) = Dai [1 + €, m(2)]; «= Oe"). = (4) 


Substituting back in equation (1), and neglecting 0(z~*), we have 
0 — (D bias 1)e,, mm” (4k/z) + (D + Leg +1, m3 
D+1, 2 
D? D 





or if we set 2€, m = Ve, m 


we have (D — 1)v% m+ (D+ 1) 41, m = 0,80 that % » = c(m,z)[(1 — 
D)/(1 + D)}*. This term also is ruled out by the requirement that F 
be finite at z = ©, so that the first approximation is 








2 2 2k 1 
F,_, a/F:n) == -|— — — — O(z~*). (5 
F-ue/Fa) = 555 + ela pl toe © 
By using ¢€; » (z) as 0(z_'), we now see, we have taken 1 + a 0(1)> 
O(z~*). Even so, if z is large enough, we shall still be able to cover most of 


the range m = 0 tom = Hi 


In a similar manner we can obtain a second approximation 


Fei, m _ 2 +2( 2k n)talptptpnpai— 
Fin D+1° #\D(D+1) D ae DD’ D D+1 





(D + 1)? (D+1)? D? Dé D™ Di 


1 1 a 
“ogi oe P™ ‘ # 











sa 
feo 


)> 


t of 








VoL. 21, 1935 MATHEMATICS: E. FISHER 531 


Turning now to equation (2), we write it 


inlFo,w(o)] =| m?— (#3) | fo” aa Fe (1/2) 


and, by utilizing (6), we obtain 
2im In[ Fy. m(z) ] ‘ | (; - Ay) ( e) 
’ = . 1 i ES aig hoe 
m? — (k — 1/2)? TT Ss ite dis 


a ule tan y 5 1 
= in| cos () see o| + - (x sec » — 5 tant — # 3) (7) 


“ 








where ¢ = sin~'(2im/z). A third approximation adds to this solution 
the terms 


{a1 + 22k — 4k? + 4k) In cos (2)] — (7 + 2k + k?) In (cosy) + 


lear dais. Sa )) een I ceaag — — 2 
40 3° + (2 *) sec 9 + (: k =) sec Qg + 


 § 27 + 10k — 16k? 
dai: ae Lal 3 4 
(5 st) sec? g + 16 sec* g + 


27 1 13 \ 
vaio ‘ins Misa 5 bia?) eee 6 
(ia a) sec® o 16 °° Qe: 


To test this formula the special case of the Bessel function was computed. 





: , n 
The above formulas give ¢ = sin~! — 
x 


n3 


° nt — 1/ 5 nd — 1/ 
(tan $ _ e) + ont (tan g+ 3 tan® e)} exp { Ont 


oe 11 27 
jin (cos : sec’ e) + 0 — z sec g-— 7 sec? g + 16 sect o — 


es! 
J,,(x) = \. (sec yg)” — 1/sn? cos {: (n + 1/9) —x+ didar i_ 








1 13 
peat 5 déenh conde 6 
- sec® 16 sec ol}. (8) 


The exponential factor is the improvement of the third approximation 
over the second. It can be neglected when x is large, or when n is con- 
siderably smaller than x. For instance, neglecting the exponential term, 
we obtain Ji(20) = 0.186, J9(100) = 0.140, and from Hayashi’s tables 
we see that Jio(20) = 0.186 and Jy;(100) = 0.137. In the following cases 
the third approximation was needed, however: Ji;(20) = 0.004, Jis(20) = 








532 MATHEMATICS: E. FISHER Proc. N. A. S. 


0.23, J;(10) = 0.234, J;(10) = 0.29. From Hayashi we obtain 0.001, 
0.25, —0.234, 0.32, respectively. 
Case IT. |m|>|k], 1. 

The function Nz, »(z) = 2~/*~” M, »(z) has the following properties 











k? — (m + 1/2)? | 4mkz | 
2°Np ————— — 2m Np » 
(2m + 1)2(Qm +2)°° "™t! ‘te = m | Nem + 
2mNz, m -1 = 0 N,, (8) —> 1. (9) 


4 m 
If Ng, m(z) = (=) Qy, m(2), then, neglecting 0(m—'), equation (9) becomes 
4 4 4 
0 = Ouse t+— Ont1 — On = [a+ (2 +) +) Ow 
2m | 4m? 2m \ 4m? 
= (s414 4 y+ Se)(ot14+™ - 1+ 2) On 


9 


22 
4m? 





The last step is valid provided we require that 1 1+ > 0 (m-'). 


Of the two solutions of this difference equation, one has an infinity atm = 
o, and so the first approximation reduces to 


a (E — 1) (1 + en(z)); B=1t+ 


Z 


9 


4m? 


e = O(m™'). (10) 








(Me, m+1 / Nz, a) = 


Substituting this back in equation (9) and neglecting 0(m~-*), we obtain 
a second approximation, by methods similar to those for case I, 


Numi, 9m? z( ke 1 7 ? 
het . —(E-1)}1+—(2+—-=-= 2) |. 
Faw 2? ( ) T om we E FE sik alll ut) 


To solve this, we notice that if 





9 
Z 


8m?\" . 
Nim = ( ~) (E — 1)"eX* ™(z), then 
| 1 kz 1 1 1 
—_ = —] a —{]1 seoteiees SSM nasi nate Tease > cla, Saran a) 
Ke, m+1— Kem +et( ve tS RE ma) + 0m ) 


Neglecting 0(m~*), the solution of this is 


1 
Kyy = mE ~ m+ kin (E =) Sine + ¥ (b2) 
‘ 2m 2 


“ 


Since Nim —>1, it turns out that e” = 1, so that the first approximation 


m—> © 








01, 


es 


nes 


ain 


‘ion 








VoL. 21, 1935 MATHEMATICS: E. FISHER 533 


for N is 


2 


8m?\” m z \* —1/2,m(E—1) 
Ne, m (z) = ey (E = 1) E- ‘in E é . (12) 


In the same manner, by neglecting only 0(m~*), the second approximation 
equals the above solution multiplied by the factor 


1 12k? + 3 6kz/m 5 
seuisitits —_— 1952 : ams ee 
exp E- (2 12k? + + - ;) | 


Since we have assumed that E > O(m-'), and have used m = © 
as a boundary point, the above solution is perhaps valid only for | m | 





z 
The above formulas give for the Bessel function 
n?\" (1 — u)” 


Jn (x) = (=) Tae | mtu —1)+ a (2 +2-5)] (13) 


2 


where uw? = 1 — - With this formula we compute Jo.(20) = 0.0198, 





where Hayashi’s tables give 0.0199. 
Case III. |z|>|m|, 1. 
By the methods of case I we obtain for a first approximation: 


Fy, m(a) = (2k)*e7* E (1 — 2)" (- =.)" exp 
Ee sa 1:-*)| (14) 


1 &b ee 
ax ei oe - = is ca 1 
4 *) > Oand | & || b/d =| 0% >. 





providedz>m,1,R ( 


For computation a second or third approximation might be necessary. 
These would contain m. 
I am indebted to Professor P. M. Morse for the interest he has shown in 


my work. 








534 MATHEMATICS: MICHAL AND ELCONIN Proc. N. A. S. 


COMPLETELY INTEGRABLE DIFFERENTIAL EQUATIONS IN 
ABSTRACT SPACES' 


By A. D. MICHAL AND V. ELCONIN 
DEPARTMENT OF MATHEMATICS, CALIFORNIA INSTITUTE OF TECHNOLOGY 


Communicated July 26, 1935 


1. Introduction.—This note briefly summarizes some of our results in 
the theory of the differential equation 


d; f(x) = F(x, f(x), &), (1) 


where the left member is the Fréchet differential? of f(x) with increment 
€, and the ranges and domains of the functions involved are in Banach 
spaces.* The rather intricate proofs, which involve most of the known 
and many new properties of abstract differentials and integrals,‘ are too 
long to be given here; but they are all included in a comprehensive memoir 
which we intend to publish elsewhere. A few indications are given of the 
wide range of results obtainable either by specializing the Banach spaces, or 
by applying the general theory to other problems in abstract analysis. 

2. Fundamental Existence Theorems.—In this section E, > are Banach 
spaces; x, u, &,zarein E; y, v, yn arein 2; a, b, c, g are positive real num- 
bers; m is a non-negative integer; and ¢(x) is any function on E to 2. 

THEOREMI. For any x, y, 2, &,» such that ||x — u\| <aand ||y — || <b, 
let F(x, y, 2) be in X, linear’ in z, and such that 


G(x, y, 2, & 0) = dey F(x, y, 2) 


exists, continuous in x, y; and for \\x — u|| < aand any m, let fy (x) be v, 
and let fm +, (x) be 


y+ ® F(u + o(x — u), fimn(u + a(x — u)), x — u)do. 


Then 
(1) For some c < a and any x such that ||x — u|| <6, Fim(x) exists and 
llfm(x) — || <0. 


(2) For any such c: tf 
|| G(x, », z, 0, »)|| < gll2il{lal! 
for some g and any x, y, 2, n such that ||x — u|| <cand||y — v|| < b, then 


(i) f(x) = limf,(x) exists for ||x — ull <c; 


(ii) G(x, y, 2, g, F(x, ¥, £)) _ G(x, y; é, z, F(x, y, z)) (2) 














VoL. 21, 1935 MATHEMATICS: MICHAL AND ELCONIN 535 


for any x, y, 2, & such that ||\x — u|| < cand ||y — v|| <b implies® f(u) = v, 
f(x) — || S b, dgf(x) = F(x, f, &), didz f(x) = G(x, f, & 2, F(x, f, 2) 


for any x, &, such that |\\x — u|| <c; 
(iii) if ¢(u) = », (x) — »|| S 6, dex) = F(x, ¢, 8), 
for any x, & such that |x — u|| <c, then for ||x — u|| <c, o(x) = f(x). 


This theorem shows the existence, under certain hypotheses on F(x, y, 2), 
of a unique solution f(x) of equation (1) after u and v have been fixed; 
but it can be shown that under similar hypotheses a unique solution 
f(x, v, u) exists for all u, v in certain domains of E and &, respectively. 
This last theorem, to which we shall refer as Theorem II, enables us to 
study the dependence of the solution on the initial parameters u, v. We 
have found that f(x, v, u) is continuous in vy and that under certain addi- 
tional hypotheses on F(x, y, z), the Fréchet differentials of f(x, v, u) with 
respect to u and rv exist. However, it appears from our latest work that 
these additional hypotheses can be weakened considerably. 

3. Specializations and Applications——Many known results may be 
obtained from the theorems of the last section by properly specializing 
Eor. Thus if £ is the real number system, then equation (1) is equiva- 
lent to the “ordinary’’ differential equation 


TO = ox, fe) 
x 


where ¢(x, y)z = F(x, y, 2), the integrability condition is automatically 
satisfied, and Theorem I includes a theorem of Kerner.* Or, if E is the 
space of all ordered sets (x1, ..., Xm) of m real numbers, and > is the space 
of all ordered sets (yi, ..., Yn) of m real numbers, then equation (1) is 
equivalent to the system of partial differential equations 


re) : 
ox, tag ead OO Me ce Jer. cede § © Ee ce 
k 


on putting F(x, y, 3) equal to 


(S. Gas. «Bus Nix. Jn) wees Dot Oeh(Fr- + Lm Vis. -YulBads 
I 1 


and Theorems I and II include theorems of Nikliborec’ and Nikodym,' 
respectively. 

New results may also be obtained in this way. For example, if £ is 
the space of real, continuous functions x(s) on a real interval (a, 6), and 
> is the real number system, then the functional equation’ 








536 MATHEMATICS: MICHAL AND ELCONIN Proc. N. A. S. 


8B B B B 
df flx(s)] = 7 olx(s) /fle(s)}, se(dt (3) 


is an instance of equation (1), on putting 


B . B 
oe - blx(s)/y, t]e(e)dt; 


and Theorems I and II become the strongest existence theorems known to 
us. In fact, they furnish existence theorems for functional equations 
much more complicated than equation (3). 

The theory of the differential equation (1) yields an existence theorem, 
under certain hypotheses on K(x, y), for the ‘implicit’ function I(x) 
satisfying 

K(x, I(x)) = 0 


where the ranges and domains of J(x) and K(x, y) are in Banach spaces. 
In making this application, we found it necessary to show that the inverse 
of a solvable” linear function Fréchet differentiable with respect to a 
parameter has itself a Fréchet differential with respect to that parameter; 
a result of considerable intrinsic interest. 

Another application of the theory is to the second order differential 
equation 


ded, f(x) = H(x, f(x), dy f(x), & 0). 


Finally, we emphasize again that only a few of the problems in which the 
theory of equation (1) is useful have been indicated here. More complete 
indications will be given in other papers. 


1 Presented to the Am. Math. Soc. at the 1934 Berkeley meeting and at the 1934 
Los Angeles meeting. Cf. Bull. Am. Math. Soc., 40, 530 and 814 (1924). 

2 Fréchet, M., Annales Sc. Ec. Norm. Sup., t. 42, 293-323 (1925). By dere” f(x, 

..» %,) is meant the Fréchet differential of f(x, ...,*,) with increments &, ..., &. 

3 Banach, S., Fund. Math., 3, 133-181 (1922). See also his book, Théorie des 
Opérations Lineaires (1932). A Banach space is briefly a complete normed vector space 
with real number multipliers. We shall assume that if a = 8, then a may be replaced 
by 8 in any proposition. : 

4 Kerner, M., Prace Matematyczno-Fizyczne, 40, 47-67 (1932). By J’ fade is meant 
the ‘‘Riemann integral” of the abstract function f(c) of a real variable o. 

5 A function is linear if it is additive and continuous. 

6 Identity (2) is the condition of complete integrability for equation (1). 

7 Nikliborc, W., Studia Math., 1, 41-49 (1929). 

8 Nikodym, O., Bull. Internat. De L’ Academie Polonaise Sci. Lets. No. 4-8, 134-144 
(1933). 

® For the functional notation see Volterra, V., Functions de Lignes, Paris, 1913. 

10 Hildebrant, T. H., and Graves, L., Trans. Amer. Math. Soc., 29, 145 (1927). These 
authors use the phrase “linear function with a reciprocal”’ instead of ‘“‘solvable linear 
function.’ 











VoL. 21, 1935 PHYSICS: C. H. FAY 537 


A REFINEMENT OF THE HEISENBERG THEORY OF FERRO- 
MAGNETISM, APPLICABLE TO SIMPLE CUBIC CRYSTALS 


By CHARLES H. Fay 
RESEARCH LABORATORY OF PHysics, HARVARD UNIVERSITY 


Communicated June 17, 1935 


Introduction.—The Heisenberg theory of ferromagnetism,’ in what we 
shall term its ‘‘simple”’ form,” involves the replacement of the energy levels 
of the spin multiplets of the micro-crystal for a given total spin quantum 
number S’ by the average energy for S’. As a supposedly better approxi- 
mation Heisenberg has used the assumption of a Gaussian distribution of 
energies for S’; however, this approximation is in some respects worse 
than the simple approximation, for it has the effect of introducing energies 
lower than that of the state of maximum .S’ for states of small spin. Hence, 
even though ferro magnetism may be obtained in a certain temperature 
range, the crystal becomes paramagiuetic at very low temperatures, instead 
of approaching its true saturation magnetization as it should. The work 
of the present paper was undertaken largely in order to get a better idea 
of the validity of the simple approximation described above. 

Method.—The method of the present paper consists essentially of re- 
garding the micro-crystal as built up of units larger than the atom but con- 
taining sufficiently few atoms to allow calculation of their unperturbed 
energy levels; the coupling energies between these units for states of spin 
S’ are replaced by the average coupling energy for S’. The method is most 
readily applicable to simple cubic crystals, which may be regarded as built 
up of unit cubes each containing eight atoms. In restricting ourselves to 
simple cubic crystals we are unfortunately giving up the chance of compar- 
ing our results with those of experiment, as simple cubic arrangement of 
atoms considered ferromagnetic is found only in certain little-studied alloys. 
Our interest, however, lies in the comparison of the results of this method 
with those of the simple Heisenberg theory. It will be seen that our 
method treats half of the interaction rigorously (instead of none as previ- 
ously), handling the remainder by an average energy approximation similar 
to that of the simple theory. This retention of some of the fine structure 
of the problem will give us a better idea of the results to be expected were 
a rigorous treatment possible. 

The energy levels of the problem of eight electrons in similar orbits at 
the corners of a cube have been worked out by Seitz and Sherman* and 
Serber.* In order to be able to use their results we further restrict ourselves 
to the case of atomic spin 1/2. 

It is assumed as usual that only the exchange integral between orbits of 
neighboring atoms is appreciable. Now of the six neighbors of a particu- 








538 PHYSICS: C. H. FAY Proc. N. A. S. 


lar atom three are in its unit cube, so that half of the atomic coupling is 
taken care of in the cube levels. The average intercube energy for total 
spin S’ is thus half the average interatomic energy for S’, being (1/2) 
(—2aXs,-s;), apart from an additive term independent of spin orientation.® 
The summation extends over all pairs of neighboring atoms. Here a is 
the Heisenberg exchange integral between valence orbits of neighboring 
atoms; s; is the vector spin matrix of atom?. We shall here use m for the 
number of cubes, so that the number of atoms is 8”. Now 

‘(Dsi)? = Lis? + 2 ue s,s; = S'(S’ + 1), 

t<J 


whence 





SS; = S'S’ + 1)/8n(8n — 1) — 3/4 (8n — 1), 


so that to significant terms 
> 58; = 35’2/8n. (1) 


The summation in (1) is over neighboring atoms. To our approxima- 
tion the portion of the energy depending on the spin is thus 


Wy = dX W, — 3aS’2/8n, (2) 
u 


where W, is the unpert urbed energy of cube u. 
The partition function for the system in a magnetic field H along the z 
axis is from (2) 


Z = > [lM exp (— W,/kT)] exp (3aS"?/8nkT — 2B8HS;/kT), (3) 
3 


the summation being over the states of the micro-crystal. Here £8 is the 
value of the Bohr magneton. The coefficient of H arises from the usual 
assumption that only the spin angular momentum need be considered in 
determining the magnetic moment. 

In order to sum (3) we make use of the assumption, true of the other 
theories, that the important terms in Z lie in the neighborhood of S’ = — 
S, = —S;, so that we may replace S’? by —S,? + 25:5, in summing (3). 
This procedure gives, since S; = > S%?, 

m 
Z = exp (—3aS,?/8nkT)[ >’ exp (—W,,/kT + 
+ (3aS;/4nkT — 2BH/kT)S™)]", (4) 


the summation in (4) being over the states of the cube. It is convenient 
to introduce the notation 


¢ = —S,/4n, y = 3a/kT, y= ye + 2BH/kT, (5) 














VoL. 21, 1935 PHYSICS: C. H. FAY 539 


¢ being the ratio of the magnetic moment to its true saturation value at T 
= 0, and 
{x} = >>, exp (—W (cube) /kT). (6) 
Y, denotes a summation over cube states of spin x, not including the spatial 
degeneracy factor. Since the cube levels are proportional to a, the quan- 
tities (6) are functions of y only. Equation (4) may now be written 
Z = exp (— 2nye?)[2{4} cosh 4y + 2({4} + {3}) cosh 3y + 
+ 2({4} + {3} + {2}) cosh 2y + 2({4} + {3} + (2) + (7) 
+ {1}) cosh y + ({4} + {3} + {2} + {1} + {O}))” 


From the assumption made in deriving (4) it follows that 
¢ = (1/4n)o In Z/dy. (8) 
From (7) and (8) we have finally 


¢ = [8{4} sinh 4y + 6({4} + {3}) sinh 3y + 4({4} + 
+ {3} + {2}) sinh 2y + 2({4} + {3} + {2} + 
+ {1}) sinh y]/4[2{4} cosh 4y + 2({4} + {3}) cosh 3y + 
Sok TTT + ({4} + {3} + .......... + {0})], 
which has besides the trivial root zero for H = 0 a real positive root for suf- 
ficiently large y. The ferromagnetic Curie temperature, defined as the 
temperature at which the pseudo-saturation magnetism vanishes, is thus 
determined by the value of y for which (9) has the double root zero. This 
condition gives 


(9) 


¥0(e) = 1, (10) 
where 


e(y) = (30{4} + 14/3} + 5{2} + {1})/(8t4} + 
+ 14{3} + 10{2} + 6{1} + 2{0}). (11) 


Results below Curie Temperature.—Equation (10) has been solved numeri- 
cally with the result y, = 1.1795, whereas the ‘‘simple’’ Heisenberg theory 
gives the value y, = 1. The Curie temperature as given by our method is 
thus 0.8478 times that of the simple Heisenberg theory. That the results 
are so little different is of considerable interest in view of the fact that 
the Gaussian distribution approximation does not admit ferromagnetism 
for simple cubic crystals. 

The pseudo-saturation magnetization curve has been obtained by nu- 
merical solution of (9). Figure 1 gives this curve, plotted against the re- 
duced temperature 7/7,. For purposes of comparison the pseudo-satura- 
tion magnetization curve given by the simple theory is also plotted against 
the appropriate reduced temperature. It is seen that on this basis the 








540 PHYSICS: C.f. FAY Proc. N. A. S. 


curves differ appreciably only at fairly low temperatures; the success of 
applications of the theory using the experimental value of the Curie tem- 
perature to determine the value of the exchange integral is thus seen to af- 
ford no very decisive check on the validity of the approximations used. 
At very low temperatures (9) becomes approximately 


¢ = 1 — (1/4) exp (—3a/kT), (12) 
while the simple theory gives 


¢ = 1 —-— 2exp (—6a/kT). (13) 






































10 
=&<—. 
ieee 
B PN 
6 a 
tS 
2 
O96 2 Y 6 B 70 
WE f 
FIGURE I 


Psuedo-saturation magnetization curves. The dashed Curie indi- 
cates the result of the simple Heisenberg theory. 


A theory for extremely low temperatures due to Bloch® gives 


¢ = 1 — 0.0587(kT/a)”. (14) 





While our result retains the exponential character of (13) its trend as 
compared to (13) is toward the slower approach to saturation given by 
(14). 

Behavior Above Curie Point.—For temperatures above the Curie point the 











VoL. 21, 1935 PHYSICS: C..f. FAY 541 


susceptibility for small fields may be obtained from (9) by retaining only 
first order terms in y. The result is 


x = 2NB~(y)/k(T — ve(y)T:). (15) 
In figure 2 (15) is compared with the result of the simple Heisenberg theory, 
x = NB/k(T — T,), (16) 


the reciprocal susceptibilities being plotted, as is customary. Experimen- 
tally it is found that the susceptibility is given to considerable accuracy 


Y 









































Wr 
FIGURE 2 
Reciprocal susceptibility curves above the Curie point. Dashed 
line, result of simple theory; solid curve, our result; dotted line, 
its asymptote. 


by a modified Curie law like (16), with, however, a 7, different from the 
ferromagnetic Curie point, defined as the second or paramagnetic Curie 
point. Now (15) gives a curve which approaches rather gradually an 
asymptote with intercept at T = 1.14T,, some 90° above the ferromagnetic 
T. for T, of the order of 640°K. (nickel). This value is hardly to be com- 
pared with experimental values for the difference between the Curie points, 
for the temperature range covered by experiment does not much exceed 
2T.. Atangent to our curve in the experimental range would give a value 








542 PHYSICS: HUTCHISSON, OSGOOD AND FEARON Proc. N. A. S. 


of the order of 10° for the difference between the two Curie points, sup- 
posing that the second Curie point could be qualitatively explained on 
this sort of basis. The experimental value for the common magnetic 
materials is about 20°. However, a paper by Forrer’? seems to show that 
hysteresis is required to account for the existence of two Curie points. 

Concluston.—The fact that our method, which takes account of one- 
half the interaction rigorously, gives results comparing closely with those 
of the simple Heisenberg theory, but differing widely from the result of the 
use of the Gaussian distribution, seems to indicate that the simple approxi- 
mation is better than the Gaussian one. The order of validity of the simple 
approximation is perhaps as good as that of the other assumptions of the 
Heisenberg theory, except at low temperatures. 

In conclusion the author wishes to thank Professor J. H. Van Vleck for 
suggesting the problem and for many helpful discussions. 


1 Heisenberg, W., Zeits. Physik, 49, 619-636 (1928). 

2 Van Vieck, J. H., Electric and Magnetic Susceptibilities, pp. 316-360. See this refer- 
ence for exposition of the method of the vector model which we use. 

3 Seitz, F., and Sherman, A., Jour. Chem. Phys., 2, 11-19 (1934). 

4 Serber, R., Jbid., 2, 697-710 (1934). 

5 Van Vleck, J. H., [bid., p. 324. 

6 Bloch, F., Zeits. Physik, 61, 206-219 (1930). 

7 Forrer, R., Jour. Physique, 1, 49-64 (1930). 


ELECTRICAL CONDUCTANCE OF SHORT GAPS IN AIR 


By E. Hutcuisson, T. H. Oscoop* anpD R. E. FEARON 


UNIVERSITY OF PITTSBURGH, PITTSBURGH, PA. 
Communicated August 12, 1935 


The transition between an open circuit and electrical contact is generally 
considered to be abrupt. Yet it seems reasonable that if a gap could be 
closed gradually enough, there would be a continuous change from open 
circuit to good contact. Conduction through short gaps usually has been 
investigated! for cases in which the width of the gap is of the order of sev- 
eral wave-lengths of visible light. Very great potential differences are 
needed to produce measurable currents under such conditions, making it 
essential to use high vacua for the experiments, as otherwise gaseous dis- 
charges occur. It is the purpose of this work to study the dependence of 
conductance upon gap width under ordinary laboratory conditions (such 
as obtain for switch contacts in air). To prevent sparking at high field 
intensities, potential differences are used which are below the ionizing poten- 
tial of the common gases of air. The use of such small potentials makes it 














VoL. 21, 1985 PHYSICS: HUTCHISSON, OSGOOD AND FEARON 543 


essential to use exceptionally short gaps if the currents obtained are to be 
capable of measurement, even with modern direct current amplifiers. 

In this research the gap considered is effectively that between a sphere 
anda plane. Assuming that the conductance falls off very rapidly as the 
gap width increases it can be shown by calculation that the conductances 
of very short gaps of this type of equal minimum width are proportional to 





Stee. Mirrors 
Mica Ring —4« aan - SF 


ihbie SX) LiGHT inal 


NX PATH 


\ 




















Se ee 
= 
































E 
a 


——, 








Copper BELLOWS 
Stee. SpRING 4—ly 


73 


ScALeE To Meas- 
| URE EXTENSION 
S OF SPRING 


























FIGURE 1 
Apparatus used to bring two nearly flat surfaces into gradual contact. 


the radii of curvature of the spherical surfaces. Three experimental ar- 
rangements were used in which the radii of curvature of the spherical elec- 
trodes were approximately 14 X 10? cm., 14 cm. and 25 K 10-4cm. Al- 
though the surfaces used in the present work were not rigorously dust- 
free, all ordinary precautions to remove dust were taken, and it is felt that 
the results represent what may be expected to occur when ostensibly clean 








544 PHYSICS: HUTCHISSON, OSGOOD AND FEARON Proc. N. A. S. 


metallic surfaces are brought into contact under everyday laboratory condi- 
tions. 

One of the experimental arrangements consisted of a gap between a 
circular stainless steel mirror (plane to within '/1) wave-length of sodium 
light and 6 mm. thick) and a thin stainless steel mirror (polished plane 
to within one wave-length and 2 mm. thick); the latter being loaded at 
the center. The thin disc was supported above the other by a very thin 
ring of mica as shown in figure 1. The mirrors were 4.0 cm. in diameter and 
the hole in the mica ring was approximately 3.2 cm. in diameter. The load 
on the upper disc was applied by means of a stretched spring to avoid the 


a Y. 





"T; 
























































eae 
7s 
= 
= Soul ‘ad 
203" 
ev 
8 a 
c [-—~e4] es q ov 
g7/ co TJ nay 
: PN 
3-6 N 
= 

3 hl 
S 
v r 
ee N 

. “e 
Ngr 
3 
— 

or) 4000 2000 3000 4000 $000 4000 7000 8000 q000 ji 0,000 14,000 
Separation at Closest Point in A 


FIGURE 2 
Conductance of an air gap between a plane and a sphere of 1430cm.radius. Since 
the conductance varies somewhat with the applied potential difference, the latter is 
indicated at the right of each point. The curve is drawn approximately through the 
points representing the conductance at the smallest potential difference which would 
yield accurate measurements. 


seismic effects of a direct load. Provision was made for covering the ap- 
paratus with a bell jar and for evacuating. 

The bending of the upper disc was calibrated by observing interference 
fringes produced when the lower steel disc was replaced by a heavy piece 
of optical glass. 

It was found that with a separating mica ring thin enough to show inter- 
ference colors, the two plates acted as though they were in contact. With 
larger thicknesses of mica, contact occurred only when the upper disc 
was loaded. As contact was approached by loading, the conductance of 
the gap gradually increased. A fairly typical graph of the variation of 
conductance with distance of minimum separation is shown in figure 2. 














Vov. 21, 19835 PHYSICS: HUTCHISSON, OSGOOD AND FEARON 545 


The separation is considered to be zero when the conductance of the gap 
no longer changes appreciably with increasing load. Since conductance 
varies somewhat with applied potential difference, the potential difference 
for each point is recorded. A curve is drawn through points for which 
the applied potential difference was just sufficient to allow current measure- 
ments to be made. In its essential features, such a curve could be dupli- 
cated at will. The exact location of points changed somewhat upon pro- 
longed application of the potential difference or even upon standing. Very 
much the same conductance values were obtained when a bell jar was placed 
over the plates and a vacuum of approximately 0.01 mm. of mercury main- 
tained for several days. 

Tests for the validity of Ohm’s law were made at various separations. 
For large separations the current is approximately proportional to the 
square of the voltage. At about 8000 A where the change of slope of the 
curve in figure 2 occurs, there is a region in which the current is nearly pro- 
portional to the three-halves power of the voltage. Below approximately 
2000 A, Ohm’s law holds to within the accuracy of the measurements (about 
5%). 

An attempt was made after two years to repeat these measurements 
using the same steel mirrors. In the meantime several small scratches 
had developed. In general a curve of the same shape was obtained except 
that the points were very much more irregular. The change in minimum 
separation from the point at which the current just became perceptible to 
that of contact was only about one-fourth as large as in the earlier work. 

A second set of observations was made using an entirely different type 
of apparatus in which the radius of curvature of the spherical electrode was 
about 14cm. In this apparatus, by means of a compound lever a highly 
polished steel spherical electrode could be brought very gradually into con- 
tact with a stainless steel mirror (plane within a few wave-lengths). One 
division on the adjusting dial corresponded to a change in separation of 
135 A. The plane steel mirror was supported at three points and could be 
replaced by a plate of optical glass. The movement of the spherical elec- 
trode could therefore be calibrated by means of interference fringes. Elec- 
trical measurements were made in the same manner as in the loaded disc 
experiment and contact was assumed when further turning of the dial 
produced no further increase in current at a constant very small potential 
difference. In addition to the conductance measurements, it was possible 
to make a direct comparison of the position of optical contact and the posi- 
tion of electrical contact. These occur within 5000 A of one another. 
Leaving the dial of the lever system at the same setting, it was possible to 
lift the steel mirror from its three-point support and replace it so that a new 
point on the same plane mirror was used to form the gap. The surfaces 
were always wiped off very carefully with lens paper before running a con- 








546 PHYSICS: HUTCHISSON, OSGOOD AND FEARON Proc. N. A. S. 


duction experiment. The drawing of a narrow strip of lens paper between 
the surfaces when they are nearly in contact was tried, but it did not seem 
to have any effect on the results. 

By replacing the steel plate by an optical flat at the close of a conduction 









































“3 
© Cede? ° 
6 eh, 
Ce, 
Pe ee eo] 
ste o\e, 
e@ 
-4|-% = 
€ © e 
3 } —* 
<= e © 
E © 
c 
=. ' 
& 
cS] 
Cc 
«es 
° 
b ] 
i ~] 
> e 
x ° 
a ° 
bi 
e 9 
& 
*e e 
© 
=7 
e °e 
2] 
fo} 
e 
-8 e 
@ a . . od 
Separation at closest point in A 
FIGURE 3 


Conductance measurements of an air gap between a plane 
and a sphere of 14 cm. radius. The erratic series of points 
(circles with dots) was obtained with a newly prepared surface. 
Subsequent measurements with same surfaces yielded points 
indicated by circles with crosses in them and by the filled- 
in circles. 


experiment, the region on the spherical electrode where contact occurred 
could be located by the interference method. Thus a minute inspection 
is possible of the region where the electric forces were the strongest. Such 
inspection revealed that the surface was not the same after the experiment 














VoL. 21, 19835 PHYSICS: HUTCHISSON, OSGOOD AND FEARON 547 


as it was at the beginning of the experiment. In each instance there were 
disturbed places, visible as darker areas in the interference pattern, approxi- 
mately two microns wide and four or five microns long. When a spot on the 
spherical electrode was thus used up, a fresh spot could be used in the next 
experiment by tilting the three-point support and thus changing the point 
where the spherical surface was tangent to the plane. 

Representative points showing the dependence of conductance upon gap 
width in this experiment are shown in figure 3. The long and extremely 
erratic series of points illustrates what was usually obtained when a newly 
prepared surface was used under scrupulously clean conditions. As the 
mirror was used, the repetition of the experiment gave successively more 
abruptly rising curves plotted between conductance and gap width. The 
more abrupt series of points were sometimes fairly reproducible, but the 
long, erratic ones never were. The change in the nature of the results 
as the mirrors were used may have been due to the production of sharp 
prominences on the surfaces by the destructive action of the electrical and 
mechanical factors which affected them. It is to be remembered in inter- 
preting these points that the gap width was measured from some arbitrarily 
chosen point where the electrical conduction became very good. Thus, it 
may be that the gap widths quoted in figure 3 represent, not the separation 
of the general level of the surfaces, but the distances between conducting 
projections on either or both surfaces. 

In the experiments using the lever, there were many abrupt changes in 
conductance as the separation was changed. This is to be ascribed to the 
breaking down of conducting bridges, possibly from mechanical causes and 
possibly from being burned up by the heating effect of currents that passed 
through them. 

The electrical properties of a gap consisting of two wires of equal di- 
ameter crossed at right angles are the same as those of a gap between a 
plane and a spherical electrode of the same radius as that of the wires 
Fine tungsten wires were mounted on the opposite sides of the slow motion 
end of the lever system described in the previous paragraphs. The wires 
were stretched between supports 0.1 cm. apart. An estimate of the forces 
exerted transversely to the wires by the largest fields used is 0.1 dyne. 
This force would be sufficient to cause a displacement of the wire of not 
more than 25 Angstrém units, assuming it is under a tension of 100 gram 
weights. The diameter of the wires used was 0.002 in. 

The small radius of curvature of the electrodes in this experiment reduced 
very greatly the area over which the surfaces were very close as they ap- 
proached contact. This change practically eliminated the averaging of the 
effect of foreign particles scattered over the surface. The result was that 
the system passed from an open circuit condition to electrical contact in a 
very abrupt manner. In fact, this change occurred in a distance shorter 








548 PHYSICS: HUTCHISSON, OSGOOD AND FEARON Proc. N. A. S. 


than the least which could be measured on the lever system used in these 
experiments. It is therefore impossible to present any conductance-gap 
width graph for this type of gap. It can be said with certainty, however, 
as a result of this experiment, that if there was any conductance through 
the short gaps which was not due to foreign particles or to bridging, such 
conductance per unit area did not exceed one reciprocal ohm per square 
centimeter for a gap as narrow as 2 X 10-®cm. A conduction curve for 
any gap narrower than this would be subject to serious criticism, as very 
little is known about the smoothness of any type of polished surfaces within 
such fine limits. 

As a result of these experiments the conclusion may be drawn that ap- 
preciable electrical current can pass between surfaces of approximately 4 
cm. diameter which are separated as much as 10,000 A in air. The fact 
that at such a distance the current is approximately proportional to the 
square of the applied potential difference leads one to believe that charged 
dust particles in the air are playing the same part that ions do in solutions. 
It is thought that the dust particles become charged at one electrode, are 
drawn over to the other and become discharged or charged oppositely and 
thus return to the first electrode to repeat the cycle. Such a process seems 
to take place even if all ordinary precautions are taken to exclude dust from 
the space between the surfaces or even if the electrodes are placed in a bell 
jar and the air pressure reduced to 0.01 mm. of mercury for several days. 
In the present experiments, it seems that when the nearly flat electrodes 
are brought closer together some of the current is carried by invisible promi- 
nences which touch each other. In this way one can account for the fact 
that gradually the relation between current and voltage changes from an 
ionic one to an ohmic one. At very small separations the current is defi- 
nitely ohmic and is probably caused by metallic bridging. 

The writers are happy to take this opportunity to express their apprecia- 
tion for a research grant for this work from the Bache Fund of the National 
Academy of Sciences. 


* Now at the University of Toledo. 

1G. Hoffman, Zeit. Physik, 4, 363 (1921); James W. Broxon, Phys. Rev., 20, 476 
(1922); F. Rother, Ann. Physik, 81, 317 (1926); F. C. Brown, Phys. Rev., 2, 314 (1913); 
Robert J. Piersol, Phys. Rev., 31, 441 (1928); R.A. Millikan, C. F. Eyring and S. S. 
Mackeown, Phys. Rev., 31, 900 (1928); T. E. Stern, B. S. Gossling and R. H. Fowler, 
Proc. Roy. Soc., A124, 699 (1929). 

2 Cf. preliminary reports, Phys. Rev., 40, 129 (1932); 47, 802 (1935). 














VoL. 21, 1935 ZOOLOGY: S. P. CARLSON 549 


THE COLOR CHANGES IN UCA PUGILATOR 


By SvEN PH. CARLSON 
STOCKHOLM, SWEDEN* 


Communicated July 22, 1935 


During recent years it has been proved, especially through the investi- 
gations of Koller and of Perkins, that there probably is an organ in the 
eye-stalks of crustaceans, which by incretory means controls the color 
changes of the body. This organ is influenced by stimulations from the 
eyes. The incretory substance is similar to a hormone and brings about 
contraction of the chromatophores in Crangon and Palaemonetes. Kropp 
and Perkins (1933) have found that this hormone occurs in the eye-stalks 
of all crustaceans thus far investigated, some 16 species, even in those 
in which there are no color changes themselves. Meyer (1931) showed 
that this hormone is not only active in the chromatophores of crustaceans 
but also calls forth contraction of the color cells in fishes (Gobius and 
Pleuronectes). In 1933 Kropp and Perkins proved that an injection of 
extract from the eye-stalks of crustaceans into tadpoles of Rana clamitans 
brings about a strong expansion of the chromatophores in this species. 
Koller states (1928) that in Crangon he has succeeded in getting expansion 
of the chromatophores by injection of an extract from the rostral region. 
Perkins and Snook (1931) and Perkins and Kropp (1933), however, have 
not observed any effects at all from such a rostral extract. 

Hanstrém (1931-1934) has found two different organs in the eye-stalks 
of crustaceans, the X-organ and the blood-gland, as he calls them. These 
are both innervated from the medulla terminalis and, to judge from their 
histological structure, they are secretory in function. 

In the course of experiments made together with Dr. Hanstrém at the 
Marine Biological Laboratory, Woods Hole, during the summer 1935 in 
studying these secretions, a number of different crustaceans have been 
investigated for their color changes. During these researches I found 
that the fiddler crab, Uca pugilator, behaved in a manner quite different 
from all other crustaceans. 

For a long time it has been known that Palaemonetes and Crangon 
grow dark if their eye-stalks are cut off, and this they do either in light or 
in darkness, on a white background or on a black one. Uca pugilator 
on the contrary grows pale if the eye-stalks are cut off which is in accordance 
with the observations of Megnsar (1912) in Uca pugnax. The dark chroma- 
tophores are maximally contracted. This can be observed especially 
clearly in the legs, where the chromatophore changes from a large and 
richly branched form to a small and punctate one. The whole animal 
grows pale and will remain so independently of the degree of light and 








550 ZOOLOGY: S. P. CARLSON Proc. N. A. S. 


the color of the background. If an extract is made from the eye-stalks 
of Uca and injected into pale animals, without eyes and with maximally 
contracted chromatophores, they grow dark again the more rapidly the 
stronger the extract. An obvious effect is obtained even by an amount of 
extract corresponding to one-tenth of one eye-stalk. After 5 minutes 
one can see under the microscope how the chromatophore branches extend. 
Maximal expansion is obtained within 2 hours, and then the animals 
slowly grow pale and within 4 hours the chromatophores are maximally 
contracted again. 

Extracts from Palaemonetes and several other crustaceans work in the 
same way. The effects are different only according to the strength of the 
extracts and all crustaceans thus far investigated have given this reaction. 

Extract from the eye-stalks of Uca, when injected into Palaemonetes 
produces a strong contraction of the chromatophores quite in the same 
way as those of Palaemonetes on itself. Extracts of all other crustaceans 
tested acted in the same way. 

In all probability we have in Uca the same common crustacean hormone, 
but the effect on the dark chromatophores of Uca is quite contrary to that 
in all other crustaceans and fishes investigated, but is exactly the same as 
the crustacean hormone has on frogs. 

In order to find out the position of the active organ, extracts were made 
from different parts of the eye-stalks. The eye-stalk of Uca is especially 
suited to such experiments on account of its length, which makes it possible 
to cut it in several suitably small pieces. As test objects eyeless Ucas 
were first used, but later eyeless and hence quite dark Palaemonetes proved 
to give a more rapid and sensitive reaction. 

The eye-stalks of Uca were cut transversally in three equal parts. Only 
the middle part proved to be active, whereas the distal and proximal parts 
were quite inactive. 

In the same way such animals in which the distal parts of both eye- 
stalks had been cut off were found to keep their original dark color. If 
then the middle part was cut off, the animals grew pale as if the entire 
eye-stalks had been removed. 

From these experiments it seems that the active substance is derived 
from the middle part of the eye-stalks of Uca. By microscopical exami- 
nation I have not been able to discover the X-organ of Hanstrém in Uca 
pugilator. On the contrary I have found a very fully developed blood- 
gland, innervated from the medulla terminalis and containing eosinophilic 
drops. It is situated on the surface of the medulla terminalis at its distal 
part and thus within the active part of the eye-stalks. I have not suc- 
ceeded in discovering any other innervated organs of secretion in this part 
of the eye-stalk. 

Thus my experiments on Uca prove that the active substance is derived 














VoL. 21, 1935 ZOOLOGY: J. M. OPPENHEIMER 551 


from the middle part of the eye-stalk and it seems probable that the 
blood-gland of Hanstrém produces it. 

I have repeated these experiments on Uca pugnax and have obtained 
exactly the same results. 

A detailed account of this work with photographs and references will 
be published later, probably in Kungl. Fysiografiska Sallskapets 
Forhandlingar, Lund, Sweden. 


* Marine Biological Laboratory, Woods Hole, Mass. 


PROCESSES OF LOCALIZATION IN DEVELOPING FUNDULUS 
By JANE M. OPPENHEIMER 
OSBORN ZOOLOGICAL LABORATORY, YALE UNIVERSITY 
Communicated August 6, 1935 _ 


Localized vital staining has been used to study developing Fundulus, 
in order to ascertain, first, the early localization of embryonic organs in 
the cleaving egg, and second, the movements by which the cells forming 
these organs attain their final position in the embryo. The technique 
employed! varies slightly from that originated by Vogt.? The results 
show that the long axis of the embryo does not necessarily coincide with the 
first or second plane of cleavage, and hence that any mosaic which exists 
in the cleaving egg is independent of the position of the planes of cleavage. 
Further, the presumptive forebrain material has in some cases been traced 
to one or two of the central cells of the sixteen-cell stage, and part of the 
germ-ring to the outer half of marginal cells of the same stage. Finally, 
gastrulation in Fundulus, while fundamentally of the same type as in the 
amphibian (Vogt*) and the trout (Pasteels*), is accompanied by accessory 
processes not described for the other forms. 

At the completion of cleavage in Fundulus, the blastoderm is a homogene- 
ous cap of cells covering a small surface of the large uncleaved yolk. Ac- 
companying or preceding the onset of gastrulation, the cells of the blasto- 
derm concentrate at the periphery to form the germ-ring, leaving a central 
area of fewer cells. The central area is slightly eccentric, since the thick- 
ened rim of the blastoderm is wider than elsewhere at the region of the 
future embryonic axis. At this localized region invagination predominates, 
accompanied by a concentration of cells which forms the embryonic axis. 
This accumulation of cells is the embryonic shield, a plate of closely packed 
cells, which is sharply distinguished from the extra-embryonic membrane 
constituting the remainder of the blastoderm. 

The staining experiments show that the cells whose derivatives form most 
of the nervous system are localized early in the embryonic shield; they 








552 ZOOLOGY: J. M. OPPENHEIMER Proc. N. A. S. 


take up their position principally by an antero-posterior stretching and 
migration, only a small group of cells, indicated by horizontal hatching in 
figures 1A-1C, converging centrally. The early shield contains a central 
axis in which the cells are packed more closely than elsewhere; these are 
the cells which form most of the midbrain region (diagonal hatching in 
Fig. 1), and most of the hindbrain and part of the spinal cord (vertical 
hatching in Fig. 1). As these two groups of cells elongate during gastrula- 
tion, they receive contributions from the cells lying immediately lateral 
to the central axis of the early shield (horizontal hatching in Fig. 1). The 
material which later will form forebrain and eye (heavy stipple in Fig. 1) 





<—__— & — oe 0) re 





FIGURE 1 


The localization of presumptive nervous tissue, mesoderm and 
endoderm in the early gastrula (A), and the middle gastrula (B), 
and the positions of these tissues in the seven-somite embryo (C). 

The position of the nervous tissue is indicated by heavy stipple 
(forebrain and optic vesicles), diagonal hatching (midbrain and anterior 
hindbrain) and vertical hatching (posterior hindbrain and spinal cord. 
The cells whose position is indicated by horizontal hatching aid in the 
formation of midbrain, hindbrain and spinal cord. 

The mesoderm is represented by the lightly stippled areas. The 
areas marked by the numbers 1, 2, 3 and 4in A and B ultimately lie 
in the regions of the embryo indicated by the arrows accompanying 
diagram C. 

The endoderm is represented by small circles. 


is not part of the embryonic shield at the beginning of gastrulation, but is 
situated in the extra-embryonic membrane anterior to the shield and joins 
the shield anteriorly as gastrulation proceeds (Fig. 1B). 

The cells constituting prospective axial mesoderm lie laterally in the 
early shield; and lateral to the shield in the germ-ring, and also in the extra- 
embryonic membrane anterior to the shield and lateral to the presumptive 
forebrain and eye cells (light stipple in Fig. 1). The cells constituting 
the lateral wings of the early embryonic shield, situated between the pre- 
sumptive nervous system and the junction of the shield with the germ-ring 
(number 2 in Fig. 1) form the mesoderm in the posterior brain region the 














VoL. 21, 1935 ZOOLOGY: J. M. OPPENHEIMER 553 


anterior cord region; this group of cells, as does all the presumptive meso- 
derm, migrates toward the midline and also stretches antero-posteriorly. 
The cells in the germ-ring adjacent to the early shield (number 3 in Fig. 1) 
migrate axially and reach the shield when the yolk is one-half to two-thirds 
covered by the blastoderm. This group of cells forms a large part of the 
mesoderm posterior to the first few somites. Cells originally over 90° 
away from the embryonic axis (number 4 in Fig. 1) join the shield as the 
blastopore closes and aid in forming the tail-bud blastema. In mesoderm- 
formation, just as previously described for nervous system formation, cells 
originally found in the extra-embryonic membrane 
join the shield anteriorly (number 1 in Figs. 1A4- 
1c). 

The addition of cells to the teleostean shield at 
its anterior part as well as at its blastoporic lip 
was evidently suspected by Wilson’ who described 
the growth of the Serranus embryo by “‘intus- 
susception”’ rather than by concrescence. The 








occurrence of the phenomenon in Fundulus sug- FIGURE 2 
gests that induction may occur independently of Diagram of the tenta- 
actual invagination. tive localization in the 


early gastrula of Fun- 
dulus. Dorsal view. 
For explanation see text. 


The endoderm is derived from cells at the 
posterior end of the shield, which form the lip and 
migrate inward along the axis mesial to the meso- 
derm. The notochord has not been positively located by the vital staining 
method, but the contributing cells are probably situated between the areas 
for endoderm and nervous system. A tentative map of the early gastrula 
is shown in figure 2. In this diagram the vertical hatching represents the 
presumptive nervous system, horizontal hatching the mesoderm, heavy 
stipple the notochord and light stipple the endoderm. This map differs 
from Vogt’s for the amphibian and from Pasteels’ for the trout principally 
in the form of the presumptive nervous system area; in Fundulus this is 
a compact area which stretches antero-posteriorly rather than a crescentic 
area which converges or rotates into position. 


1 Oppenheimer, J. M., Science, in press. 

2 Vogt, W., Arch. Entwmech., 106, 542-610 (1925). 

3 Vogt, W., Jbid., 120, 384-706 (1929). 

‘ Pasteels, J., C. R. Assoc. des Anat., 29iéme Réunion, Bruxelles, 451-458 (1934). 
§ Wilson, H. V., Bull. U. S. Fish Commission, 9, 209-278 (1889, printed in 1891). 








554 STATISTICS: T. L. KELLEY Proc. N. A. S. 


AN UNBIASED CORRELATION RATIO MEASURE 


By TrRuMAN L. KELLEY 
HARVARD UNIVERSITY 


Communicated July 16, 1935 


The properties of the correlation ratio have been very thoroughly 
studied and reported upon. It has long been a necessary instrument in 
the study of the nature of regression. The work of Fisher! in 1922 made 
it a very precise instrument in studying the goodness of fit of second and 
higher degree regression lines. 

It, however, lacks a certain desirable simplicity of meaning in that its 
value, n, obtained from a sample, differs from the population value, 7, 
not only in a random manner due to the fluctuation of the particular sample, 
but also in a systematic manner which is a function of the number of arrays 
in which the data are recorded. This systematic difference between 
n and 7 is, of course, well known to the expert statistician, and allowed for 
in his interpretations, as, for example, is automatically the case in the use 
of the following formula by Fisher 


Bags 2 
x= (NT (1) 
eR 
where N is the number of cases in the sample, & the number of arrays in 
which the dependent variable is classed, 7 the ordinary correlation ratio, 
R the ratio of the standard deviation of the differences between the points 
upon the regression line used and the means of the arrays of the sample to 
the standard deviation of the dependent variable (for a linear regression 
line, R is simply 7, the ordinary product-moment correlation coefficient), 
and x? is the ordinary x? distributed nearly in the Pearson type III manner 
and with a number of degrees of freedom equal to [k — f(R)] in which 
f(R) is the number of linear restrictions placed upon the frequencies in 
determining the regression line employed, it equaling 2 in the case in 
straight line regression, 3 for second degree parabolic regression, etc. 
Entering a table giving probabilities for values of x? with the value 

given by [1] and a number of degrees of freedom equal to [k — f(R)] 
yields a value P which is the probability that if the true regression, or 
regression in the population, is of the form assumed, a divergence from 
it as great as that observed would arise as a matter of chance. Thus P, 
derived for x’, is an immediately interpretable statistic. We may note 
the simplicity of several of the other concepts. 


N 
k 


Il 


the number of cases in the sample 
the number of arrays 














VoL. 21, 1935 STATISTICS: T. L. KELLEY 555 


k — f(R) 


the number of degrees of freedom in the differences be- 
tween the means of the arrays and the points on the re- 
gression line 

R = an unbiased estimate of the R, which we will designate R, 
in the population. 


However, 7 is not an unbiased estimate of the correlation ratio, 7, in the 
population. 

The intent of the present article is to provide a new correlation ratio 
squared, ¢«?, which is an unbiased estimate of 7”; to provide its standard 
error; and to substitute it in place of n? in Fisher’s formula [1]. So far 
as P and x? are concerned, its use leads to identical results with [1]. 

The need to allow for the systematic effect of fine grouping for small 
numbers in arrays was handled by Pearson? in 1923, but he did not use 
Fisher’s results of 1922. In 1906 Blakeman? provided a test for linearity 
which now, upon the basis of the later work of Pearson and Fisher, we 
must believe to be inadequate. 

In the following treatment the notation will indicate population statistics 
by employing a tilda circumflex, ~; statistics which are means from a 
large number of samples by the caret, A; statistics which are sample 
means by the macron, -; and other sample statistics by symbols having 
none of these circumflexes. The difference between the A and the ~ 
statistics is entirely a matter of the size of the samples entering into the 
average yielding the A statistics. With large samples every 4 statistic 
approaches the corresponding ~ statistic. Subscripts a, b, c, ...k will 
refer to the k successive classes of the independent variable. Also sub- 
script a will refer to any one such array. Letting v without subscript 
represent the variance of the dependent variable for the marginal total, 
and v with subscript the variance for an array, the true, or population, 
value of the correlation ratio is given by 


Siigi', 
DSi, 
wherein 7, is the number giving the same proportion in array a as in the 
population, and S indicates a summation for all arrays, a, 6, ... k. 
The usual or ‘‘raw’’ correlation ratio squared is given by the same formula 
dropping the circumflex. This value, n?, is subject to a fineness of grouping 
error. 


(2) 


v= 1 


If the numbers in the arrays for a sample are m,, , ... %, giving pro- 
portions differing in a random manner from the proportions in the popu- 
lation, which are fiz, %, ... 7,, the function, 

Sai, 


Sn, 








556 STATISTICS: T. L. KELLEY Proc. N. A. S. 


will differ by chance only from 


Stiga 
Stig 
Thus we may write the approximate equality 
Snda 
y= 1-—. 3 
7 55m, (3) 


This is an estimate of 7? having no systematic error. We now desire 
estimates of @,, i, ... %,, and # having no systematic errors. As shown 
by ‘‘Student,’’* these are readily available. They are, respectively, 


NgVq NyUp NV Nv 
Ng — 1 m—1  m™—-1lN-1 
wherein N = Sn,. 
If we introduce these values in (3) we will obtain a formula for 7? in 
which the estimated true variances for the arrays are weighted according 
to the number of cases in the sample in each array. Each array variance, 








nN, v 





i.e., each *! is an estimate of the residual variance in the dependent 


a 
variable knowing the value (category) of the independent variable. How- 
ever, the average of independent measures of the same thing having the 
least standard error is given by weighting each inversely as its variance, 
as early shown by Gauss.’ The variance of 








Ne V 
a a 4 
oe (4) 
ne 
equals ja 7 times the variance of v,, which for any distribution is as 
_- 
given by “Student” 
—1 
v(v,) = 203 “25 (5) 
Nq 
We accordingly obtain the variance of (4) 
25? 
Ng —1 


Introducing the reciprocals of these as weighting factors in place of n,, 
My, ... M, in (3) yields 


(52) (25) 
ii ie ee on 
v res Sed 








(6) 





N=-1 20, 











VoL. 21, 1935 STATISTICS: T. L. KELLEY 557 


It is entirely conceivable that 1, ~ i, ¥ @,, etc., but since these are of 
the nature of residual variances, it would not seem violent to assume them 
equal, in which case (6) becomes 

(N — 1)Sn,v, 


’@=1 “(N—B)No- (7) 


Ngo e 
, we may write (7) as 
v 





Since 7? = 1 — 
Ae ei SE eee ~ e) 
N-—k N-k : 


Comparing results here obtained with those of Pearson in 1923° we find 
certain small but significant differences. 


2 





(8) 


Pearson gives the chance 7? value, in case true 7? = 0, as ————, whereas 
° n N 


k 
f 8 btai 
rom (8) we obtain Vo 





r which value has also been obtained by Wishart.’ 
Pearson gives 7? corrected for fineness of grouping, in case 7? is fairly 
Nr? —k +3 

N—k+3 
Let us now determine the standard error of ¢?, given by (7). 


we) = (aa) # Ge") © 
Nov = Sx? 


large, as equal to , which differs slightly from (8). 





in which x is the dependent variable as a deviation from the sample mean. 
For array a each x may be written 
Xa = X + (x si £4) 


where Z, is the mean for the array as a deviation from the sample mean, 
and, as shown by “Student,’’ Z, is uncorrelated with v(x, — 7). Wemay 
therefore write 





Sx? = Sn,x? + Sn,v, (10) 
SNVq 
Let f= _ (11) 
N-1\? 
Then v(e?) = (7 i :) vy (12) 
2 a 2 
And = eS a v (13) 


' 42 = 4(N — he? ‘ 











558 STATISTICS: T..L: KELLEY Proc. N. A. S. 
N—k 
f=-2) 
N-1 
Sn4v 
+ +__. (14) 





aK State + Smgk 
df — dSmgv, — ASMgg + dSmq% 





























_ = ‘ales (15) 
f Sta Sd + Sn,®2 
Employing the earlier assumption that 4, = % = ... d,% 
ene! 
nt = Se — 4, = (= Oe, (16) 
Sx? = (N — 1). (17) 
Taking logarithmic differentials of (14) 
d GSNgVa dNv 
Fin ee " cis) 
7 (N—k)i, (N—1)0 
Squaring, summing and dividing by the number of samples 
ee rn, Coal | wp NM, — 1 
ae oe a 
fp (N—k)# (N—1)*2 (N—k)\(N — 1)d,0 
2 2 4(1 — n?’) 
= a - | aN (19) 


N-k N-1 N-1 
From (12) 


(2) = (1 — e*)*(N — k)*(N — 1)? Vo ot a 
ee es: TE ah. Nei 





1» - U—)* f2k-D), ys 
ie Pre Ger +41 (20) 
am 1-é s2(e — 1) . s . ‘ 





_ 2%) _ =e) f2e@-) , 4.) fenotl ¢ 
ini ke 42 9 4(N — 1)é? (N-k —— t. ; (small f (22) 
2) eon ae ae ‘ { 2 not | 
agit VN-1\N-k a 1 fsbo (23) 


Formulas (22) and (23) are not usable formulas in case ¢ is small, as certain 
higher order terms have been neglected. Formulas (20) and (21) are 











on 
cn 
© 


VoL. 21, 1935 STATISTICS: T. L. KELLEY 


1 1 
generally useful except when — is not small in comparison with ——. 
N VN 


To obtain a test for goodness of fit we will make a substitution in (1). 
In place of n” we will substitute e? as given by (8). This yields 


_(W= BE - RB) + -D0- RF) 


24 
eae (24) 





x? 


{k — f(R) degrees of freedom} 
A second substitution is called for in that R? will differ in a slightly 
systematic manner from R®. We have 


R=R+6R 
and R? = R? + o(R). 


If we define p? by the equation 
R? = p? + v(R) 


the quantity (e? — p?) and not (e? — R?) is a quantity whose mean value 
would equal zero in the case of correctness of the assumption as to the 
nature of the regression. However, the difference between (e? — p?) and 
(ce? — R?*) will ordinarily be so small that unappreciable gain in inter- 
pretability of the elements entering into (24) would result from incor- 
porating p? instead of R’? into the formula. As (24) stands, the magnitude 
of (ce? — R?) is itself a good indication of goodness of fit. In particular, 
if (e — R*) <0, an excellent fit is indicated without calculating x? or P. 
If («2 — R*) is much less than zero, one should be skeptical of the arith- 
metical accuracy of his computations or of the logical soundness of some 
step in his treatment. 


1R. A. Fisher, “The Goodness of Fit of Regression Formulae and the Distribution 
of Regression Coefficients,” Jour. Roy. Stat. Soc., 85, Part IV, July (1922). 

2“On the Correction Necessary for the Correlation Ratio,” Biometrika, 14 (1923). 

3 John Blakeman, ‘‘On Tests for Linearity of Regression in Frequency Distributions,” 
Ibid., 4 (1905-06). 

4 “The Probable Error of a Mean,” Ibid., 6, No. 1 (1908). 

5 See Kelley, Truman L., Statistical Method, 1923, Formula [309]. 

6 “On the Correction Necessary for the Correlation Ratio,’’ Biometrika, 14 (1923). 

7 John Wishart, “A Note on the Distribution of the Correlation Ratio,’ Ibid., 24 
(1932). 








